The E cacy of GlOSS for the Text Database Discovery Problem

نویسندگان

  • Luis Gravano
  • Anthony Tomasic
چکیده

The popularity of information retrieval has led users to a new problem: nding which text databases (out of thousands of candidate choices) are the most relevant to a user. Answering a given query with a list of relevant databases is the text database discovery problem. The rst part of this paper presents a practical method for attacking this problem based on estimating the result size of a query and a database. The method is termed GlOSS-Glossary of Servers Server. The second part of this paper evaluates GlOSS using four di erent semantics to answer a user's queries. Real users' queries were used in the experiments. We also describe several variations of GlOSS and compare their e cacy. In addition, we analyze the storage cost of our approach to the problem.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The E ectiveness of GlOSS for the Text Database Discovery Problem

The popularity of on line document databases has led to a new problem nding which text databases out of many candidate choices are the most relevant to a user Identifying the relevant databases for a given query is the text database discovery problem The rst part of this paper presents a practical solution based on estimating the result size of a query and a database The method is termed GlOSS ...

متن کامل

Precision and Recall of GlOSS Estimators for Database Discovery

On line information vendors o er access to multi ple databases In addition the advent of a variety of INTERNET tools has provided easy distributed access to many more databases The result is thou sands of text databases from which a user may choose for a given information need a user query This pa per an abridged version of presents a framework for and analyzes a solution to this problem which ...

متن کامل

The Effects of L1 and L2 Glossing on the Retention of L2 Vocabulary in Intentional and Incidental Settings

The current study investigated the effects of L1 and L2 glosses on L2 vocabulary retention in incidental and intentional settings. To this end, 100 intermediate Iranian female learners of English as a foreign language at Soroosh High School were given a pre-test to make sure that they do not have any prior knowledge of the target words. Reading passages with three different glossing conditions ...

متن کامل

The Effects of Glossing Conventions on L2 Vocabulary Recognition and Production

To investigate the effects of different glossing conventions on vocabulary recognition and recall, 158 participants were given a pre-test to make sure that they did not have any prior knowledge of the target words. Reading passages with four different glossing conventions (interlinear, marginal, pre-text, and post-text) were given to eight groups. Four groups received interlingual glosses and f...

متن کامل

ارائه مدلی برای استخراج اطلاعات از مستندات متنی، مبتنی بر متن‌کاوی در حوزه یادگیری الکترونیکی

As computer networks become the backbones of science and economy, enormous quantities documents become available. So, for extracting useful information from textual data, text mining techniques have been used. Text Mining has become an important research area that discoveries unknown information, facts or new hypotheses by automatically extracting information from different written documents. T...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998